May 17, 2016, 06:08 PM // 18:08
|
#1
|
Desert Nomad
Join Date: Mar 2008
Location: I would never play off Occupy Wall St. for my guild name
Guild: We Are the 1 Percent
Profession: Me/
|
Saving Threads
Hi All,
As we know, gwguru is closing down.
I would like to go back and save all the collection threads and any others that people want in text files so they can be accessed in the future. They have great information.
I'm trying to decide the best way to do this. Lets talk in context of the collections thread.
http://www.guildwarsguru.com/forum/y...t10463144.html
It's 115 pages long so obviously just re-typing every post is not an option. There are pictures too.
I had 2 ideas:
1. If one could pull all the html for every page, it would be 115 html files, couldn't those pages be replicated elsewhere exactly? I can trivially pull the html files for every page using matlab and save them all... Then someone could take those html files and replicate the current threads exactly so the information isn't lost.
2. Again, I can pull the html files for every page of the thread. I could make a computer code that goes through said html file and saves all the post information and discards all the other html garbage and saves it as a text file... This isn't as trivial as just pulling the html to automate it for them all.
Keep in mind I'm no computer expert. But I use computers for work to code stuff and stuff =P.
What are people's thoughts on the best way to save old threads people care about? Are these the best ideas? Am I forgetting something?
Cheers,
Buzan
EDIT::::
I've saved 24 threads so far. In the future I will post which ones at some point maybe. Next will be the ones here.
Last edited by Surge goes pre; May 19, 2016 at 01:48 AM // 01:48..
|
|
|
May 17, 2016, 06:23 PM // 18:23
|
#2
|
Jungle Guide
Join Date: Oct 2008
Location: There
Guild: [ToA]
|
Hey surge, I think that our two reasonable options are either screen shots(if hosted somewhere reliable will outlast the original pictures that we would be linking in the new threads) or your option of actually transporting the webpages.
Depending on work involved with either, and who is capable of either, I think these are the two reasonable options. I think having archival screenshots is an OK and probably easier solution that lasts forever rather than until the original uploads break, but people may want the feeling of scrolling through the same pages again. I'm not sure.
If we wanted to be really hardcore about preserving GW Guru while we're all working towards this, we could do your solution with new uploads for the images. We can assign threads and have people go through and re-upload the old images to new hosts to then be inserted back into the new threads. This is very elaborate of course, but would mean our cherished old threads stay alive forever(or until people lose interest to maintain) instead of slowly decaying as old image hosts die/drop the images/etc.
|
|
|
May 18, 2016, 12:52 AM // 00:52
|
#3
|
Administrator
|
You could take the second option further. If you can strip out all of the html tags and whatnot from the raw thread data and get it into .csv format (or something similarly suitable) you could then make a new (offline) .html file that reads that data and displays it in a format similar to Guru. Preserving the images is a different issue altogether.
Not sure if that would be satisfactory but it would be a viable thing to, given enough know-how.
__________________
|
|
|
May 18, 2016, 02:54 AM // 02:54
|
#4
|
Desert Nomad
Join Date: Mar 2008
Location: I would never play off Occupy Wall St. for my guild name
Guild: We Are the 1 Percent
Profession: Me/
|
Marty,
I've been experimenting.
I can pull the html from a guru page and open it perfectly in firefox or safari. Even the pictures which are embedded from imgur seem to load in.
I think I'm gonna save everything I personally want this way.
Buzan
|
|
|
May 19, 2016, 01:20 AM // 01:20
|
#5
|
Lion's Arch Merchant
Join Date: Sep 2005
Location: Moscow, Russia.
Guild: Random Ascalon Fools, Soldiers of Thunderstorm.
Profession: R/Mo
|
Quote:
Originally Posted by Surge goes pre
I've already posted in multiple places that I'm going to try to preserve the knowledge. And I believe I have the computer know how to save a lot of it. I've already saved 5 threads with 300+ pages of posts to my hard drive by ripping the HTML. These were the 5 threads that I most wanted. That amounted to 50MB of HTML code. I'm happy to fill my 32GB flash drive with guru pages. But...
TELL ME WHAT KNOWLEDGE (THREAD URLS) YOU WANT ME TO SAVE AND I WILL ADD THEM TO RIPPING CODE.
My goal is exactly what you are worried about but no one seems to want to help. Saving the knowledge. So everyone (not pointing this at Max - general statement) help me do it instead of telling people it's a goal. I have a plan - it may not be perfect, but it's the only one I've heard that seems feasible. All I need the URL of the thread you want to save. If no one tells me and no one comes up with a better plan, the knowledge is gone. But at least I have a plan.
Later we can decide how to make the information available again. But at least a copy will be saved!! Then we decide how to make it available. I don't know if the best way is through a drop box, or on legacy or gw2guru, or a different website with only these files.
So far only cosy has asked me to save any thread - the armor dye thread. And it will be added to my code when I'm home later.
As I said, if the community decides to all go to legacy, I go too. If it's split, I'll be split. If it's all to gw2guru, I'll go there. Wherever I go I wanna be the forum historian .
|
Alright, then, here are those I'd wish to see saved in their entirety:
1) Challenge to all the hardcore Pre-Searing people
2) paw·ned²: All-In-One Team Build & Template Editor
3) LDoA under 17 hours
4) Death Emote
5) Borat's Guide To Pugging Correctly
6) GvG - Improvement
7) The essential reminiscing of good times 'thread'...
8) Report of a very dangerous bug
9) do you remember when
10) Players with R15?
11) Lol paragon gvg
12) Ha in crisis
13) Beginner's Guide to Guild vs. Guild Battles
14) Avatar of Lyssa
15) The Aspiring Drunkard's Guide to Binge Drinking
16) Index of Ideas - Check this thread and use SEARCH before posting ANY new thread (this pretty much needs saving of all of the threads contained therein)
17) The best guilds in game
18) Announcing GWLP
19) HA golden ages...
20) Pokemon M Aster's Ele ball on the loose
21) Leeloof got r15
22) Legendary Hero
23) Rank Fifteen
24) GvG Split Idea
25) Rank 15
26) My HA-builds
27) Guide to beating Zergway in HA
28) Rank 15
29) HoH Relic Run
30) Interrupter Bot Program, is it possible?
31) Question on Higher ranked teams
32) [Guide] Guild Wars Errors - Explanations & Solutions
33) What is the best HA-guild at the moment?
34) Interrupter guide? Tips?
35) MOST 6v6 HoH holds ???
36) Famous people and guilds in HA
37) Screenshot of the Rank 12 Emote
38) I *think* they finally fixed the droknar armor in ascalon arena exploit!
39) Hi. Read me before you post stupid monk questions.
40) You have died 0 times @ Hell's Precipice
41) guild wars: is it really all skill?
42) Top Alpha Testers Announced!
43) Happy Birthday Spooky!
44) Omnia Mutantur, Nihil Interit
And thanks in advance for this initiative of yours!
Last edited by Smoke Nightvogue; May 19, 2016 at 05:40 AM // 05:40..
|
|
|
May 19, 2016, 05:37 AM // 05:37
|
#8
|
Desert Nomad
Join Date: Mar 2008
Location: I would never play off Occupy Wall St. for my guild name
Guild: We Are the 1 Percent
Profession: Me/
|
Quote:
Originally Posted by Shasgaliel
|
I think this one is an important one ^
|
|
|
May 19, 2016, 10:59 AM // 10:59
|
#9
|
Academy Page
Join Date: Jul 2009
Location: Finland
Profession: E/
|
Appreciate the archiving efforts immensely, there's so much history on this site that deserves to be preserved. Off the top of my head, this thread in particular comes to mind:
http://www.guildwarsguru.com/forum/w...t10379144.html
140 pages of pure gold, gathered over 6 years, with some absolute classics in there.
|
|
|
May 20, 2016, 07:58 PM // 19:58
|
#10
|
Site Contributor
Join Date: Dec 2005
Location: UK
Guild: [SoF]
|
Does anyone have advice on the best way to save a thread in this way? Is it something that anyone could do to get the threads that they want?
|
|
|
May 20, 2016, 08:37 PM // 20:37
|
#11
|
Forge Runner
Join Date: Mar 2008
Profession: Me/
|
Interesting how IPB (what Guild Wars 2 Guru runs) lets you download topic pages with a simple button, but this installation of vB doesn't.
Last edited by Cuilan; May 20, 2016 at 08:40 PM // 20:40..
|
|
|
May 20, 2016, 09:13 PM // 21:13
|
#12
|
Desert Nomad
Join Date: Mar 2008
Location: I would never play off Occupy Wall St. for my guild name
Guild: We Are the 1 Percent
Profession: Me/
|
Bsoltan, I don't think anyone can do it. At least not my way. I use the 'urlread('http://www.guildwarsguru.com/..........')' matlab function and then fprint to write to file.
|
|
|
May 20, 2016, 10:11 PM // 22:11
|
#13
|
EXCESSIVE FLUTTERCUSSING
Join Date: Mar 2007
Guild: SMS (lolgw2placeholder)
Profession: Me/
|
Quote:
Originally Posted by Cuilan
Interesting how IPB (what Guild Wars 2 Guru runs) lets you download topic pages with a simple button, but this installation of vB doesn't.
|
This version of vB hasn't been modified since before Curse purchased the website.
__________________
All seems lost now, but still we must fight on.
|
|
|
May 21, 2016, 10:09 PM // 22:09
|
#14
|
Lion's Arch Merchant
Join Date: Sep 2005
Location: Moscow, Russia.
Guild: Random Ascalon Fools, Soldiers of Thunderstorm.
Profession: R/Mo
|
Quote:
Originally Posted by bsoltan
Does anyone have advice on the best way to save a thread in this way? Is it something that anyone could do to get the threads that they want?
|
Quote:
Originally Posted by Surge goes pre
Bsoltan, I don't think anyone can do it. At least not my way. I use the 'urlread('http://www.guildwarsguru.com/..........')' matlab function and then fprint to write to file.
|
Here's the link to the program I'm trying to save the entire forum content with at the moment. Not sure how it all turns out in the end, though, since GWG is not simply a regular website, but a database-utilizing PHP application, which makes it much more complicated for the script when it comes to retaining the existing level of data consistency.
|
|
|
May 22, 2016, 10:40 AM // 10:40
|
#15
|
Site Contributor
Join Date: Dec 2005
Location: UK
Guild: [SoF]
|
Quote:
Originally Posted by Smoke Nightvogue
Here's the link to the program I'm trying to save the entire forum content with at the moment. Not sure how it all turns out in the end, though, since GWG is not simply a regular website, but a database-utilizing PHP application, which makes it much more complicated for the script when it comes to retaining the existing level of data consistency.
|
I have been trying the exact same one, takes its time and same as you. I'm not sure what the result will be.
|
|
|
May 22, 2016, 11:50 AM // 11:50
|
#16
|
Desert Nomad
Join Date: Sep 2005
Location: Wakefield, West Yorkshire, Uk, Nr Earth
Guild: Alternate Evil Gamers [aeg]
Profession: N/
|
Same here I have 11GB saved up to now but I haven't checked it over yet Seems to have done the trick though for now.
I have a feeling it's not going to be this simple but having the entire site and forums for offline viewing is awesome
|
|
|
May 23, 2016, 06:37 AM // 06:37
|
#17
|
Site Contributor
Join Date: Dec 2005
Location: UK
Guild: [SoF]
|
It worked! How awesome.
|
|
|
May 23, 2016, 07:11 PM // 19:11
|
#19
|
Site Contributor
Join Date: Dec 2005
Location: UK
Guild: [SoF]
|
Quote:
Originally Posted by bsoltan
It worked! How awesome.
|
Just as well.. turns out that httrack didn't archive everything. It might all be there but isn't easily linked together browsable from the forum.
Maybe with some fiddling. I'm trying WebCopy as well.
|
|
|
May 26, 2016, 06:41 PM // 18:41
|
#20
|
Departed from Tyria
Join Date: May 2007
Guild: Clan Dethryche [dth]
Profession: R/
|
Last edited by Shayne Hawke; May 26, 2016 at 10:30 PM // 22:30..
|
|
|
Thread Tools |
|
Display Modes |
Linear Mode
|
Posting Rules
|
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off
|
|
|
All times are GMT. The time now is 12:53 PM // 12:53.
|